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ABSTRACT 

We present a detailed analysis of the structure and resolved stellar populations of simulated merger 
remnants, and compare them to observations of compact quiescent galaxies at z ~ 2. We find that 
major merging is a viable mechanism to produce systems of ~ 10 11 Mq and ~ 1 kpc size, provided 
the gas fraction at the time of final coalescence is high (~ 40%), and provided that the progenitors 
are compact star-forming galaxies, as expected at high redshift. Their integrated spectral energy 
distributions and velocity dispersions are in good agreement with the observations, and their position 
in the (v ma j/a,e) diagram traces the upper envelope of the distribution of lower redshift early-type 
galaxies. The simulated merger remnants show time- and sightline-dependent M/L ratio gradients 
that result from a superposition of radially dependent stellar age, stellar metallicity, and extinction. 
The median ratio of effective radius in rest-frame V-band light to that in mass surface density is ~ 2 
during the quiescent remnant phase. This is typically expressed by a negative color gradient (i.e., red 
core), which we expect to correlate with the integrated color of the system. Finally, the simulations 
differ from the observations in their surface brightness profile shape. The simulated remnants are 
typically best fit by high (n 3> 4) Sersic indices, whereas observed quiescent galaxies at z ~ 2 tend to 
be less cuspy ((n) ~ 2.3). Limiting early star formation in the progenitors may be required to prevent 
the simulated merger remnants from having extended wings. 

Subject headings: galaxies: evolution, galaxies: formation - galaxies: structure - galaxies: stellar 
content 



1. INTRODUCTION 

Recent surveys of the high-redshift universe have 
identified a substantial population of massive quiescent 
galaxies, already in place at z > 2 (e.g., Labbe et al. 
2005; Daddi et al. 2005; van Dokkum et al. 2006; Kriek 
et al. 2006). Studies of their structural parameters have 
convincingly ruled out simple monolithic collapse models, 
in which little to no structural evolution is expected. In- 
stead, observational studies find galaxies, and especially 
quiescent galaxies, to grow significantly in size as time 
progresses (e.g., Trujillo et al. 2006; Toft et al. 2007; 
Zirm et al. 2007; van Dokkum et al. 2008; Cimatti et 
al. 2008; van der Wei et al. 2008; Franx et al. 2008; 
Buitrago et al. 2008). At z ~ 2.3, massive quiescent 
galaxies are typically 5 times more compact, and two 
orders of magnitude more dense than local ellipticals of 
the same mass (e.g., van Dokkum et al. 2008, hereafter 
vD08). Bezanson et al. (2009) note that, even though 
their effective densities (measured within one effective ra- 
dius r e ) are strikingly high compared to local ellipticals, 
the central densities measured within a fixed aperture of 
1 kpc exceed those of local ellipticals by no more than a 
factor 2-3 (see also Hopkins et al. 2009d). This obser- 
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vation suggests an inside-out growth, in agreement with 
stacking results by van Dokkum et al. (2010). 

Motivated by these recent observational developments, 
several mechanisms have been proposed to incorporate 
the constraints on structural evolution into galaxy forma- 
tion theories. In most cases, minor and/or major merg- 
ers are invoked to explain the observed size evolution 
(Khochfar & Silk 2006a; Naab et al. 2007, 2009; Hop- 
kins et al. 2009a) . Briefly, mergers were more gas-rich at 
high redshifts, and hence formed a larger fraction of their 
stars in a nuclear dissipational component, explaining 
their compact nature. Subsequent (dry) merging activ- 
ity puffs up the system without adding too much mass or 
new stars, which would violate observational constraints. 
However, alternative scenarios involving an expansion of 
the stellar distribution as response to significant mass 
losses have been suggested as well (Fan et al. 2008). In 
either case, an accurate observational characterization of 
the size-mass relation provides a crucial test for galaxy 
formation models. 

Given the paucity of kinematic mass measurements 
based on absorption- line spectra of z > 1.5 galaxies (al- 
though see Cenarro & Trujillo 2009; Cappellari et al. 
2009; van Dokkum, Kriek & Franx 2009), studies of the 
high-redshift size-mass scaling relation to date have fo- 
cussed on stellar mass estimates from spectral energy dis- 
tribution (SED) modeling (e.g., Williams et al. 2010). 
Significant systematic uncertainties related to the as- 
sumption of an IMF (e.g., van Dokkum 2008; Dave 2008; 
Wilkins et al. 2008) and the choice of a stellar popula- 
tion synthesis code (e.g., Maraston et al. 2006; Wuyts et 
al. 2007; Muzzin et al. 2009a) remain. Uncertainties re- 
lated to the star formation history, metallicity, and dust 
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attenuation are relatively modest for the quiescent pop- 
ulation, they only contribute significantly to the overall 
uncertainty during earlier, actively star-forming phases 
(Wuyts et al. 2009a). However important an accurate 
characterization of mass, we focus in this paper on the 
measurement of the second parameter of the scaling re- 
lation: galaxy size. 

Observations probe the projected distribution of light, 
sampling it by a discrete number of pixels after it was 
smeared by a point spread function (PSF). In addition, 
the signal is superposed by noise. The translation to a 
physically more meaningful mass profile involves the as- 
sumption of a mass-to-light ratio M/L. Although often 
for simplicity assumed to be a constant, spatial variations 
in M/L may occur due to age, metallicity and/or dust 
gradients. Furthermore, since the total size of a galaxy 
is ill-defined, one refers to (circularized) size as the ra- 
dius r e containing half the mass. Given the finite image 
resolution, this quantity is generally obtained by fitting 
a template profile, taking pixelization and PSF smearing 
into account. In most of the literature, a one-component 
Sersic (1968) profile has been adopted, providing satis- 
fyingly flat residual images given the noise level of the 
observations. 

Numerical simulations provide an excellent tool for the 
interpretation of galaxy structure. The simulated data 
offers a three-dimensional view of the mass, age, and 
metallicity profile at high resolution, free of sky noise 1 . 
By feeding the output to a radiative transfer code and 
producing mock observations, each of the above aspects 
related to the nature of observational data can be iso- 
lated, and its effect analyzed. For example, contrast- 
ing the light profiles of local gas-rich merger remnants 
and ellipticals with those of simulated merger remnants, 
Hopkins et al. (2008b, 2009b) demonstrated that a two- 
component profile (consisting of an inner dissipational, 
and outer violently relaxed component) provides both a 
better fit and a physically more meaningful interpreta- 
tion of their structure than a single Sersic profile. 

In this paper, we compare the structure of simulated 
merger remnants to the best observations of compact 
quiescent galaxies at z ~ 2 to date. In addition, we 
discuss the presence of M/L ratio gradients that may 
bias measurements of the half-mass radius, and can be 
revealed by multi-wavelength structural studies with the 
high-resolution cameras onboard Hubble Space Telescope. 
This study complements the comparison between obser- 
vations and merger simulations by Wuyts et al. (2009b) 
that focussed on the integrated colors, number and mass 
densities of high-redshift quiescent galaxies. 

We describe the simulations, and mock observations 
based thereupon in §2. There, we also caution for artifi- 
cial heating in simulating regimes of extreme density. We 
discuss the relation between size and mass for quiescent 
galaxies with a range of formation histories in §3, and 
address their kinematics in §4. Next, we analyze radial 
variations in the M/L ratio, their origin in terms of stel- 
lar populations, and color gradients through which they 
manifest themselves (§5). Finally, we exploit realistic 
mock observations of the simulations to fit their surface 

1 The finite number of particles introduces particle noise, but 
for the simulations analyzed in this paper this is negligible on the 
scales we study. 



brightness profiles alongside real high-redshift compact 
galaxies. §6 highlights the cuspiness of the simulated pro- 
files, and explores possible origins of an apparent profile 
mismatch with respect to the observations. We summa- 
rize the results in §7. 

Throughout this paper, we adopt the following cosmo- 
logical parameters: (Qm , Qa 5 h) = (0.3, 0.7, 0.7). 

2. THE SIMULATIONS 

2.1. Main characteristics 
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Fig. 1. — Surface mass density profiles of a compact merger 
remnant with final stellar mass of 1.7 X 10 11 Mq, simulated with 
1, 2, and 3 times the default spatial resolution adopted in this 
paper. The profiles are converged outside of a softening length. 

This work is based on a suite of smoothed particle 
hydrodynamic (SPH) simulations representing isolated 
and merging galaxies at redshifts z ~ to 3 that were 
performed with the GADGET-2 code (Springel 2005). 
The code uses an entropy-conserving formalism of SPH 
(Springel & Hernquist 2002), and includes gas cooling, a 
multi-phase model for the interstellar medium (ISM) to 
describe star formation and supernova feedback (Springel 
& Hernquist 2003), and a prescription for supermassive 
black hole growth and feedback (Springel, Di Matteo & 
Hernquist 2005; Di Matteo, Springel & Hernquist 2005). 

The progenitor disks in our simulations are embedded 
in an extended dark matter halo with a Hernquist (1990) 
profile. They span a range of baryonic masses from 7 x 
10 9 Mq to 4 x 10 11 M Q , and initial gas fractions of 20 
- 80%. We mostly focus on the subset of massive gas- 
rich merger simulations in which the progenitors were 
scaled to approximate the structure of disk galaxies at 
redshift z = 3, following Robertson et al. (2006). Briefly, 
this means that the mass- and redshift-dependent halo 
concentration measured by Bullock et al. (2001) was 
adopted: 
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where M co u q ~ 8 x 10 12 hr 1 M Q is the linear collapse 
mass at z=0. The virial mass and virial radius of the 
progenitors were scaled as follows: 
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FlG. 2. — Size evolution after final coalescence of a compact merger remnant with a final stellar mass of 1.7 x 10 11 Mq. The top 
line for each color-coded component indicates the 3D half-mass radius, whereas the light (dark) polygon below illustrates the full (central 
50-percentile) range of 2D projected half-mass radii of the system as viewed from different sightlines. Panels (a) to (d) represent realizations 
of SPH simulations with increasingly finer timestep resolution and otherwise identical conditions. Numerical heating causes an artificial 
increase in size when the orbital paths of SPH particles in a dense environment are integrated with insufficient resolution. For a system of 
~ 10 11 Mq and half- mass radius of ~ 1 kpc, the size evolution is not yet converged for the default GADGET timestep resolution. 



where V V i r is the virial velocity and H(z) is the Hub- 
ble parameter. Disk sizes were initialized according to 
the Mo et al. (1998) formalism for dissipational disk 
galaxy formation assuming the fraction of the total angu- 
lar momentum contained in the disk equals the fraction 
of the total mass contained in the disk. The disk scale 
length is then derived from the halo concentration C V i r 
(Eq. 1) and the galaxy spin A, where we adopt a default 
A = 0.033, as motivated by cosmological N-body simu- 
lations (Vitvitska et al. 2002). In practice, this means 
that the z — 3-scaled progenitor disks have effective radii 
that are a factor 1.7 smaller than similar mass galaxies 
today, in agreement with the observed size evolution of 
star- forming galaxies (Franx et al. 2008). 

Equal-mass mergers were simulated for a range of or- 
bital configurations, from coplanar to polar to various 
tilted disk orientations, following Cox et al. (2006). 
Specifically, we analyzed simulations with the range of 
spin axis orientations of the progenitor disks listed, in 
standard spherical coordinates, in Table 1. In typical 
runs, each of the two progenitor galaxies initially con- 
sists of 60000 to 120000 dark matter particles, 40000 gas 
and 40000 stellar disk particles, and one black hole sink 
particle. For the mass range probed by our simulations, 
this corresponds to typical mass resolutions of the bary- 
onic and dark matter particles of M^ al = 1 — 8 x 10 5 Mq 
and Mdm = 0.5 — 4 x 10 7 Mq respectively. In addition 
to equal-mass mergers, we also ran simulations in which 
the disk galaxy was left to evolve in isolation, simulations 
of unequal-mass mergers, and scenarios where a merger 
remnant undergoes subsequent merging. 

We adopted a gravitational softening length of e = 140 
pc. A resolution study for one of our fiducial merger 
simulations, where we increased the mass resolution by 
a factor 4 and repeated the simulations with two and 
three times the spatial resolution (i.e., adopting soften- 
ing lengths that are half or a third of our default) and 
otherwise identical conditions confirms that the adopted 
resolution is sufficient for the purposes of this paper (Fig- 
ure 1). The total stellar mass of the merger remnant 
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a Uniquc orientation identification from Cox et al. (2006). 
^Selected by Barnes (1992) to be unbiased initial disk orientations 
according to the coordinates of two oppositely directed tetrahedra. 



varies by a few percent only when increasing the spatial 
resolution. At radii larger than the softening length, the 
surface mass density profiles are converged at the 10% 
level. Within a radius of 140 pc, the amount of stel- 
lar mass assembled in the nucleus tends to be somewhat 
larger in the higher resolution runs. We furthermore re- 
fined the GADGET-2 timestep resolution by a factor 
a/10 with respect to the GADGET-2 default, resulting 
in runtimes that are longer by the same factor. Figure 
2 illustrates the importance of the latter choice when 
studying (sub)structure at high spatial densities. The 
panels illustrate four realizations of a gas-rich merger 
simulation with a final stellar mass of 1.7 x 10 11 Mq 
and half-light radius of ~ 1 kpc, identical except for 
the timestep resolution with which particle orbits are in- 
tegrated. The timestep scales with ~ y/crj, where the 
respective value of the GADGET-2 parameter 77 = Er- 
rTolIntAccuracy is shown in the panel. For each realiza- 
tion, we show the 3D (top line) and 2D (bottom polygon) 
half-mass radius evolution as function of time since the 
merger, which we define as the radius of a sphere (respec- 
tively circle) encompassing half of the (projected) stellar 
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mass. Throughout this paper, we define time of merging 
as the moment when the peak in star formation activ- 
ity is reached. Different colors represent the size evolu- 
tion of the galaxy as a whole (black), the young com- 
ponent of stars that formed within a 250 Myr interval 
around final coalescence (blue), and the old component 
of stars that were already formed prior to the nuclear 
starburst (red). Several conclusions can be drawn from 
Figure 2. First, numerical heating can lead to artificial 
growth of the simulated merger remnant. Since this ef- 
fect is not uniform over the galaxy, but manifests itself 
particularly in the dense and young central component, 
it can also artificially alter the profile shape. Further- 
more, the stellar mass formed during the nuclear star- 
burst steadily decreases when a finer timestep resolution 
is adopted, by 26% when comparing simulation (a) to (d) 
from Figure 2, and by 10% when comparing simulation 
(b) to (d). This trend suggests that, in addition to ar- 
tificial growth during the post-merger phase, the imple- 
mentation of star formation and/or feedback processes 
may also depend on the length of the integration time 
step. This result has important implications for studies 
of structural evolution of massive galaxies based on cos- 
mological simulations, where finite computational power 
imposes a delicate tradeoff between the box size (essen- 
tial for reliable number statistics at the massive end), the 
spatial resolution, and the timestep resolution. Clearly, 
by compromising the timestep resolution to a level where 
no numerical convergence is reached, spurious growth in 
size and smoothing of cusps will be superposed on galaxy 
growth by real physical processes. 

A second conclusion to draw from Figure 2 is that, 
once a converging timestep resolution is adopted, we 
find no evidence for size evolution after final coalescence 
in our binary merger simulations 2 . Third, the young 
dissipational component in this simulation that started 
with a gas fraction of 80% and was still very gas-rich 
(/gas = 0.43) at the time of final coalescence is more 
than an order of magnitude smaller than the old compo- 
nent of previously formed stars. Finally, by definition the 
2D half-mass radius (which depends on viewing angle) is 
smaller than the 3D half-mass radius. We find a typical 
r e ,3D /r e ,2D ratio of 1.4. Furthermore, we find that the 
ratio r e ^D / r e,2D depends more strongly on viewing angle 
for the young component than for the old component, im- 
plying that the recent, dissipational star formation event 
took place in a disk-like structure, whereas the violently 
relaxed component has a more spherical shape. We find 
similar properties for the merger remnants produced by 
simulations that started with different orbital configura- 
tions. 

2.2. Translating simulations to observables 

The fluxes, colors and light profiles of the merger rem- 
nants are computed from the simulation output in two 
steps. First, the intrinsic stellar emission is derived 
from a stellar population synthesis code, where we treat 
each particle as a Simple Stellar Population (SSP) with 
its stellar mass, age and metallicity computed by the 
GADGET-2 code. We adopt a Kroupa (2001) IMF and 

2 Note that this simulation does not include subsequent merging 
or gas infall, and only treats stellar mass loss in an instantaneous 
manner (see Springel & Hernquist 2003). 



compute the photometry using Bruzual & Chariot (2003) 
or Maraston (2005) models. We find that our results are 
independent of the choice of stellar population synthesis 
code. Initial conditions (stellar age and metallicity) of 
the stellar and gas particles present at the start of the 
simulation were set by a simple closed box formalism de- 
tailed by Wuyts et al. (2009a). The precise choice of 
these initial stellar population properties has a negligi- 
ble impact on the nature of the merger remnants that 
are the focus of this paper. 

In step two, we use the information on the gas dis- 
tribution and enrichment to compute the attenuating 
effect of dust on the emerging galaxy light. We per- 
form our analysis using two independent codes that are 
frequently used in the literature. The first is a line-of- 
sight attcnatuation code (LOS) that simply computes the 
wavelength-dependent absorption to each stellar particle 
from the mctallicity-weighted diffuse gas column den- 
sity between that stellar particle and the observer (see 
Hopkins et al. 2005; Robertson et al. 2007; Wuyts 
et al. 2009a,b). The second code, SUNRISE (Jons- 
son 2006; Jonsson, Groves & Cox 2010), uses a Monte 
Carlo methodology to track photon packets on their way 
through the dusty ISM, and models the effects of both 
absorption and scattering. In addition, SUNRISE uses a 
sub-grid model to account for the attenuation by birth- 
clouds (HII and photodissociation regions) that surround 
young star clusters, computed using the photoionization 
code MAPPINGS (Groves et al. 2008). Furthermore, 
SUNRISE tracks dust temperatures and re-emission of 
absorbed light at longer wavelengths (see, e.g., Younger 
et al. 2009; Narayanan et al. 2010). However, since no 
high-resolution structural information is available in the 
far-infrared, and dust re-emission is negligible during the 
quiescent merger remnant phase, this aspect of the ra- 
diative transfer is not discussed in this paper. Likewise, 
the GADGET-2 simulation keeps track of gas accretion 
onto a central supermassive black hole, but its emission 
is negligible during the quiescent merger remnant phase 
of interest. 

We compute realizations of the radiative transfer with 
attenuation laws representative of the Milky Way and the 
Small Magellanic Cloud. Unless specifically mentioned, 
our conclusions are qualitatively the same independent 
of the radiative transfer code or attenuation law used. 

2.3. Mock images 

We use the vD08 sample of compact quiescent galaxies 
at z ~ 2.3 as reference. All 9 galaxies have a spectroscop- 
ically confirmed Balmer/4000A break (Kriek et al. 2006). 
Seven of them were observed for 3 orbits in the F160W 
filter using the NIC2 camera on HST, probing their rest- 
frame optical emission. The two brighest galaxies were 
exposed for 2 orbits. At a drizzled pixel scale of 0.0378", 
the point-spread function (PSF) is well sampled, and 
the observations yielded the highest-resolution surface 
brightness maps of such galaxies to date. 

In order to establish a fair comparison, we produce 
mock observations placing the simulated merger rem- 
nants at the same rcdshift, using the same observed fil- 
ter and an identical pixel scale. We convolved the re- 
sulting postage stamps with a PSF extracted from the 
vD08 NIC2 images, applied Poisson noise, and added 
them into empty regions of the NIC2 images in order 
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to guarantee similar noise properties. We randomized 
the sub-pixel position of the simulated galaxies, but find 
that the recovered structural properties are independent 
of this treatment. 

In the following, we first analyze the true half-mass (§3) 
and half-light (§5) radii of the simulated galaxies as mea- 
sured from the full (noise-free and high-resolution) infor- 
mation available from simulation and radiative transfer. 
Next, we include the effects of pixelization, finite reso- 
lution, and limited signal-to-noise by running the two- 
dimensional fitting code GALFIT (Peng et al. 2002) on 
the real and mock NIC2 images (§6). 

3. SIMULATED MERGER REMNANTS ON THE SIZE - 
MASS RELATION 

Wuyts et al. (2009b) demonstrated that the inte- 
grated stellar population properties (rest-frame optical 
and optical-to-NIR colors, as well as specific star for- 
mation rates) of simulated merger remnants are consis- 
tent with those of observed quiescent galaxies at z <~ 2. 
Here, we investigate whether a major merger formation 
scenario can also account for the remarkable compact- 
ness of the observed quiescent systems. A useful diag- 
nostic to address this question is the size-mass relation. 
In Figure 3, we mark the local spheroid size-mass rela- 
tion (Shcn et al. 2003) with a solid line. As reliable 
size measurements of complete samples of high-redshift 
quiescent galaxies are only available over a narrow mass 
range, we indicate a fiducial size-mass relation at redshift 
z ~ 2.3 (dashed line) by assuming the z ~ slope and 
adopting the median size and mass of the vD08 sample 
as zero-point anchor. We note that both observational 
(Trujillo et al. 2006; Ryan et al. 2010) and theoretical 
(Khochfar & Silk 2006a) studies have suggested that the 
size-mass relation becomes shallower with redshift. Ryan 
et al. (2010) report a mass dependence of the power-law 
index a in the size evolution R e /R e z=0 = (1 + z)~ a of 
aw -1.8 + 1.41og(M*/10 9 M Q ). We caution that this fit 
is driven by measurements of galaxies at < z < 2, and, 
as stated by Ryan et al. (2010) , should be considered pre- 
liminary at best. Nevertheless, it is illustrative to plot 
the corresponding size-mass relation at z ~ 2.3 (dotted 
line). We conclude that the ultra-compactness of mas- 
sive (> 10 11 Mq) quiescent galaxies at z <~ 2 is a robust 
result. Tighter constraints on the sizes of high-redshift 
spheroids of lower mass will be essential to understand 
whether they formed through similar mechanisms. The 
main focus of this paper is on compact galaxies at the 
high-mass end. 

Overplotted in Figure 3, we show the 2D (i.e., pro- 
jected) half-mass radii of simulated merger remnants as 
a function of their stellar mass, as seen from 100 view- 
ing angles uniformly spread over a sphere. We consider 
merger simulations with a range of stellar masses, and 
in each case only plot the snapshots more than 100 Myr 
after the final starburst. Encircled are the remnants pro- 
duced by merging progenitors that were scaled to repre- 
sent z = 3 disks (i.e., reaching the remnant phase around 
z ~ 2.3). The other simulations started out with progen- 
itors scaled to represent local disk galaxies. For every 
simulation, we define a dissipational fraction / gas as the 
gas fraction of the system 125 Myr before the peak in star 
formation rate (SFR) is reached. Even though some of 
our simulations start with initial gas fractions as high as 



80%, /gas rarely reaches values above 50%, because rapid 
star formation in the progenitor disks, especially during 
first passage, consumes significant amounts of gas before 
final coalescence. 

Figure 3 clearly illustrates that, at a given mass, the 
size of a merger remnant is smaller when the dissipational 
fraction is higher, and even more so when the progenitor 
disks had a more compact nature to start with. Tidal 
torques are responsible for channeling large amounts of 
gas to the central region (Barnes & Hernquist 1991, 1996) 
where it is consumed in a starburst (Mihos & Hernquist 
1994, 1996). We conclude that the location of the ob- 
served z ~ 2 quiescent galaxies on the size-mass dia- 
gram (the vD08 sample has median properties r e = 0.9 
kpc, M = 1.7 x 10 11 M Q ) can straightforwardly be ex- 
plained by major merger activity provided the progeni- 
tors at high redshift were more gas-rich and had scale- 
lengths smaller than today's disk galaxies. This idea was 
first formulated by Khochfar & Silk (2006a), and is fur- 
ther detailed by Hopkins et al. (2009a; 2010a). Sig- 
nificant obervational support for increased gas fractions 
in star-forming galaxies towards higher redshift was in- 
ferred from Ha spectroscopy by Erb et al. (2006), and 
more recently confirmed on the basis of molecular line 
measurements (Baker et al. 2004; Coppin et al. 2007; 
Daddi et al. 2008; Tacconi et al. 2010). Likewise, a de- 
crease in the size of star-forming galaxies at a given mass 
is observationally well established (Trujillo et al. 2006; 
Franx et al. 2008; Williams et al. 2010). 

4. KINEMATICS 

A proper characterization of galaxy structure requires 
knowledge of the mass-to-light ratio to determine the 
stellar mass, and of its gradient to translate the mea- 
sured half-light radius to a radius containing half the 
mass (see §5). One way to circumvent M/L ratio effects, 
is probing the central potential directly through kine- 
matics. However, measuring the velocity dispersions of 
quiescent z ~ 2 galaxies from their absorption line spec- 
tra is extremely expensive in terms of telescope time. 
To date, one such measurement (van Dokkum, Kriek & 
Franx 2009) at z > 2 has been carried out, totaling 29 
hours on a 8m class telescope. At somewhat lower red- 
shift and mass (log M = 10— 11), a stacked measurement 
of similar red nuggets at 1.6 < z < 2.0, as well as two 
individual measurements at z <~ 1.4 were presented by 
Cappellari et al. (2009). In Figure 4, we compare the 

measured velocity dispersion to the virial estimator 

where M is the stellar mass derived from SED modeling. 
The dashed line indicates the proportionality followed by 
present-day early-type galaxies (van Dokkum & Stanford 
2003; Cappellari et al. 2006). Color-coded by their gas 
fraction shortly before the final starburst, we overplot the 
same simulated merger remnants whose progenitors were 
scaled to represent high-redshift star-forming galaxies as 
displayed in Figure 3. Both the observed and simulated 
quiescent galaxies show a clear correlation between the 
measured velocity dispersion and what would be esti- 
mated based on virial arguments. Within the error bars, 
the observational results are all consistent with having 

the same scaling between \J~^- and a as early-type galax- 
ies in the nearby universe. Simulations of gas-rich merg- 
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Fig. 3. — Size-mass diagram of simulated merger remnants, color-coded by the gas fraction of the merger from which they descend, 
measured 125 Myr prior to the peak in star formation rate. Simulations where the progenitors were scaled to represent disk galaxies at 
z = 3 are encircled. The solid line indicates the size- mass relation of present-day early- type galaxies (Shen et al. 2003). The quiescent 
galaxies at z ~ 2.3 from vD08 lie on the dashed line. The dotted line indicates the size-mass relation at z ~ 2.3 based on an extrapolation of 
the mass-dependent size evolution measured by Ryan et al. 2010. At a given mass, merger remnants are more compact as the dissipational 
fraction increases. 
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Fig. 4. — Velocity dispersion of quiescent galaxies at 1.4 < z < 
2.2 as function of the virial estimator \ — based on their stellar 



mass and size. Simulated merger remnants span a similar range, 
but at a given a extend to higher values of the virial estimator 
when the dissipational fraction is large (> 40%). 



ers are able to produce remnants with similar velocity 
dispersions. At a given velocity dispersion, the virial es- 



timator ^f- is larger for runs with a higher dissipational 

fraction, implying that mergers of varying dissipational 
fraction are non-homologous (see also Robertson et al. 
2006; Hopkins, Cox & Hernquist 2008). Briefly, the scal- 
ing factor k in the relation 

(4) 

where M is the stellar mass and r e the stellar half-mass 
radius, depends on the profile shape of the stellar mass 
distribution (lower k for cuspier systems), and on the 
baryon-to-dark matter ratio within the stellar effective 
radius (lower k for more baryon-dominated centers) . Our 
simulations show that both factors are a strong func- 
tion of the dissipational fraction / gas of the merger. Sys- 
tems with larger / gas are increasingly baryon-dominated 
in their centers. Their cuspy profile shapes are discussed 
at length in Section 6.1. 

Although such measurements are not yet available ob- 
servationally at z ~ 2, it is interesting to consider the 
degree of rotation expected in high-redshift compact qui- 
escent galaxies. To this end, we study the ratio of ro- 
tational over random motion (J^sL where the rotation 

v er 1 

velocity is measured along the major axis) as a function 
of cllipticity in Figure 5. The gray-coded distribution in- 
dicates the locus occupied by quiescent merger remnants 
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Fig. 5. — (v/c, e) diagram for simulated merger remnants of 
~ 10 11 Mq and ~ 1 kpc size (gray-coded distribution). Ellipticities 
of quiescent galaxies at z ~ 2.3 with similar mass and size from 
vD08 are indicated with dashed vertical lines. Early- type galaxies 
at 2 ~ and z ~ 1 are overplotted for reference. The green 
line marks the relation expected for an oblate isotropic rotator. 
The simulations suggest that rotation may on average play a more 
dominant role in early-type galaxies at high redshift than at low 
redshift, due to the increasingly dissipational processes through 
which they formed. 



that match the vD08 galaxies in size and mass. These 
remnants are the product of gas-rich merger simulations 
with a range of orbital configurations, and we observe 
them from 100 sightlines uniformly distributed over a 
sphere. The green line marks the relation expected for 
an oblate isotropic rotator (Binney 1978): 

- = \/t^ (5) 

a V 1 — e 

Vertical dashed lines indicate the ellipticities of the 
z ~ 2.3 compact galaxies of vD08, as measured on the 
high-resolution NIC2 images. For reference, we also plot 
the location of massive early-type galaxies (of larger size 
than those at z ~ 2) at z ~ 1 (van der Wei & van der 
Marel 2008) and at z ~ (a complete sample extracted 
from the HyperLeda database by Paturel et al. 2003, as 
detailed by van der Wei & van der Marel 2008). The 
closed and open circles represent galaxies with an E and 
SO morphology respectively. 

The rotation parameter of the simulated merger rem- 
nants increases with ellipticity. At a given ellipticity, 
the compact remnants show a larger degree of rotational 
support than the bulk of early-type galaxies today. Their 
kinematics resemble those of the fastest rotators of the 
z ~ sample at each ellipticity. Cox et al. (2006) demon- 
strated that the rotation parameters and flattening of 
local spheroids with less rotation can be reproduced by 
merger simulations tuned to lower redshift, with lower 
gas fractions (see also Naab, Jesseit & Burkert 2006). 
A hint of increasing rotational support towards higher 
redshifts, where mergers are expected to be increasingly 
dissipational, may already be observed when comparing 
the measurements at z ~ 1 to those at z ~ 0. It re- 
mains to be seen whether this trend extends to z ~ 2, 



and whether similarly high values of — , of up to unity, 
are present as anticipated by our simulations. The large 
range of ellipticities in the vD08 sample, reaching values 
of e > 0.6 which are larger than found in our simula- 
tions, may point to rotation being indeed important in 
these systems. 

5. M/L RATIO GRADIENTS 

Size measurements have traditionally been carried out 
using one waveband only, typically the longest wave- 
length available at high resolution. By definition, this 
technique probes a monochromatic surface brightness 
profile. Evidently, the half-light radius derived from it 
equals the physically more meaningful half-mass radius 
only under the assumption of a spatially constant M/L 
ratio. 

Having established that gas-rich mergers are a viable 
mechanism to collect ~ 10 11 Mq of baryonic material in 
a ~ 1 kpc radius, we now use our simulations to test this 
assumption for the quiescent remnants. Using the ra- 
diative transfer methods described in §2.2, we compute 
the half-light radius in three rest-frame wavebands U, 
V, and J, and contrast it with the half-mass radius of 
the system for 100 lines of sight. Figures 6 and 7 show 
the resulting light-to- mass size ratio r e .ii g rit/ r 'e.mass as a 
function of time since the merger for the same simulation 
shown in Figure 2(c). The central line indicates the me- 
dian evolution over all viewing angles, whereas the light 
and dark shaded regions mark the central 50% and 100% 
percentiles respectively. 

Panels on the left illustrate the time evolution of 
fe, light /r e , mass ignoring attenuation by dust. In this case, 
all scatter at a given time is due to projection effects. 
The scatter is limited to a few percent in rest-frame V, 
which is the band most closely corresponding to F160W 
at z ~ 2.3. Initially, as many new stars are formed in 
the nuclear starburst, the intrinsic light profile is more 
centrally concentrated than the stellar mass profile (i.e., 
fe, light /i" e , mass <C 1). On timescales of a few hundred 
Myr, as the bright and short-lived O and B stars die 
out, the nuclear emission dims with respect to the outer 
parts of the galaxy, and the M/L gradient flattens. In 
fact, around 400 Myr after the peak in star formation 
rate, the light profile in the rest-frame optical becomes 
more extended than the mass profile. This trend is more 
pronounced as we consider shorter wavelengths, reach- 
ing r e i light /r e , mass ~ 2 in rest-frame U. The reason for 
this reversed M/L gradient is that the young stars in 
the center formed out of more metal-enriched gas than 
the stars that constitute most of the galaxy outskirts, 
making them fainter and cooler (Binney & Merrifield 
1998, and references therein). We illustrate the phys- 
ical conditions underlying the M / L ratio gradients in 
Figure 8 for the snapshot 480 Myr after the merger. At 
this time, the remnant has a specific star formation rate 
(3.7 x 10 -11 yr _1 ) and broad-band colors similar to those 
of observed quiescent galaxies. Plotting stellar age as 
function of distance from the stellar particles to the cen- 
ter (Figure 8a), three loci in age can be distinguished, 
corresponding to stars formed in the progenitor disks, 
during first passage, and during final coalescence. The 
radial position in the merger remnant is clearly correlated 
with the epoch at which the stars were formed. Figure 8a 
also illustrates that the low level trickle of star formation 
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Fig. 6. — Ratio of half-light to half-mass radius as function 
of time since the merger, as determined from the intrinsic 
(unattenuated) photometry (left panels) and the attenuated 
photometry (right panels). Mass-to-light ratio gradients orig- 
inate due to a combination of age, metallicity, and extinction 
gradients that depend on time, wavelength, and viewing an- 
gle. The half-light radius of quiescent merger remnants mea- 
sured in the rest-frame V-band is typically larger by a factor 
1.5 to 2 than the projected radius containing half the mass. 
This ratio increases to a factor 2-3 in the rest-frame [/-band. 
Our result implies that the observed compact quiescent galax- 
ies (see, e.g., vD08), if formed by a similar gas-rich merger 
process, may be even more compact in terms of mass than 
previously assumed. 



present after the nuclear starburst happens mostly out- 
side the stellar half-mass radius (r e ^n — 0.9 kpc). This 
is a secondary effect contributing to the larger extent of 
the light profile compared to the mass profile. Figure 
8b demonstrates the presence of a negative stellar metal- 
licity gradient, of slope A ^°^ ^ = —0.35. This slope 
is a factor 1.5 to 2.5 steeper than that of typical mas- 
sive early-type galaxies today (Rawle, Smith & Lucey 
2010; Kuntschner et al. 2010), although those exhibit a 
large scatter in gradients. Together with their compact 
nature (§3) and fast rotation (§4), this implies the gas- 
rich merger remnants at z ~ 2 cannot evolve passively 
into present-day massive ellipticals. Subsequent (dry) 
merging has been proposed as a mechanism to slow the 
rotation, grow the size, and dilute the metallicity gra- 
dient, with the extent of the dilution dependent on the 
properties of the merger (Di Matteo et al. 2009). 

In addition to stellar age and metallicity, the gas and 
dust distribution and extinction column towards stellar 
particles also vary radially. The remaining gas in the 
1 < r < 30 kpc range has a negative metallicity gradient 
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Fig. 7. — Idem as Figure 6, but with photometry computed with 
the SUNRISE radiative transfer code. 



(Figure 8c), while at even larger radii the SPH gas parti- 
cles show a wide spread in metallicities. The normalized 
cumulative distribution of dust (taken to be proportional 
to the metallicity- weighted gas distribution) is contrasted 
with that of stars and gas in Figure 8d. The correspond- 
ing spherically averaged density profiles, normalized to 
the density at 1 kpc, are presented in Figure 8e. The 
gas is distributed over scales that are an order of mag- 
nitude larger than that of the stellar distribution. Both 
gas and dust (~ metallicity-weighted gas) reach a peak 
density near the stellar half-mass radius (r e ^ry = 0.9 
kpc). The tail of young stars at ~ 1 kpc visible in Fig- 
ure 8a shows that it is also here that the small amount 
of star formation that is still present after the merger 
(SPR _ 3 7 x 1Q -ii yr -i) takes p i ace Q as t h at dur- 
ing earlier phases resided within this radius has largely 
been consumed by star formation, or has been blown 
out by feedback processes from supernovae and AGN. 
Although this is not visible in the spherically averaged 
density profile of Figure 8d, the central cavity devoid of 
gas and dust is not spherically symmetric, but rather ax- 
isymmetric. The resulting radial distribution of optical 
depths towards the stellar particles therefore depends on 
the line of sight. We illustrate this distribution for two 
characteristic viewing angles in panel (f) and (g). For the 
viewing angle presented in panel (f), the effective atten- 
uation of the region within the half-mass radius is larger 
than that outside. Consequently, the attenuated y-band 
half-light radius viewed from this angle is 4.8 times larger 
than the half-mass radius. For the viewing angle of panel 
(g), the opposite is the case and the resulting attenuated 
TZ-band half-light radius is smaller than the half-mass 
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Fig. 8. — Radial stellar population profiles of a merger sim- 
ulation snapshot during the spheroid phase, half a gigayear 
after final coalescence. We show (a) the distribution of stel- 
lar age as a function of distance to the galaxy center, (b) the 
stellar metallicity increasing towards the center, (c) the radial 
dependence of the gas-phase metallicity, (d) the normalized 
stellar, gas and dust mass distributions, (e) their respective 
density profiles normalized to the density at r — 1 kpc, (f) 
and (g) the distribution of optical depths towards all stellar 
particles for two random viewing angles. 



radius by a factor 1.4. 

The right-hand panels of Figure 6 illustrate how the 
superposition of a dust, age, and metallicity gradient 
translates into the light-to-mass size ratio r e ji g ht/r e , mass , 
and how this quantity depends on time relative to the 
merger, line of sight, and wavelength. During the nu- 
clear starburst, the M/L ratio gradient is completely 



reversed with respect to the intrinsic (unattenuated) 
M/L ratio gradient, because of central dust obscuration. 
The median light-to-mass size ratio then drops, reaching 
r e, light /r e , mass values below 1 around 300 Myr after the 
peak in star formation rate, as the central dust content 
decreases while the young, massive stars in the center 
are still alive. Finally, as the system reaches a quiescent 
remnant phase (t s incc ^ 500 Myr), the age gradient has 
faded and the combination of a negative extinction and 
metallicity gradient results in typical half-light radii of 
2-0lg;g, 1.1+0 21 an d 0.7lg;o times the half-mass radius at 
isincc = 500 Myr in rest-frame U, V , and J respectively. 
At tsincc = 1 Gyr, the half-light radii in the rest-frame U-, 
V-, and J-band are 3.4l§;°, 1-8±J;|, and 0.8±g;g times the 
half-mass radius. Here, the error bar indicate the central 
50% percentile of the scatter due to line-of-sight varia- 
tions. Typically, the distribution shows an extended tail 
towards large relight /?~e, mass ratios. Both the line-of-sight 
scatter and the median value of r e ,iight/^e,mass increase 
towards shorter wavelengths. 

Computing the synthetic photometry with the inde- 
pendent radiative transfer code SUNRISE (Figure 7) 
gives results that are qualitatively consistent with those 
obtained from the LOS code. SUNRISE predicts typical 
light-to-mass size ratios of 3.6, 2.4, and 1.3 in rest-frame 
U, V, and J respectively during the quiescent phase. The 
sightline dependence shows a similar behavior as seen in 
the results from the LOS code, i.e., with a tail towards 
large r^rightAe.mass ratios (ratios of 5 and above). 

We tested the dependence of our results on the adopted 
attenuation law, on the assumed age and metallicity of 
stars present at the start of the simulation, and whether 
an age and/or metallicity gradient was already in place 
in the initial progenitor disks, and find consistent results 
with variations much smaller than those from sightline 
to sightline. Likewise, similar results were obtained from 
simulations where the progenitor disks were merged with 
a different orbital configuration. 

Peirani et al. (2010) simulate minor mergers between 
an elliptical galaxy and a satellite (spiral galaxy), and 
find qualitatively similar stellar age gradients: the young 
stars that formed by dissipational processes reside in the 
center of the remnant. The evolution of half-light radii 
presented by Peirani et al. (2010) resembles that seen in 
intrinsic light in our major merger simulations. However, 
the dust columns, and therefore the impact of attenua- 
tion on the observerd M/L gradients, are larger for the 
major than for the minor merger simulations. The as- 
sumption of a fixed dust-to-gas rather than dust-to-metal 
ratio, and the absence of AGN feedback in the Peirani et 
al. (2010) simulations may also contribute to differences 
in the predicted M/L gradients. 

In summary, we find that the interplay between an age, 
metallicity, and dust gradient manifests itself as a M/L 
ratio gradient that depends on time, line of sight and 
wavelength. Since an observer can only probe the latter 
parameter directly, the detection of color gradients show- 
ing preferentially red cores in compact quiescent galaxies 
would support the presented scenario. Moreover, from 
the simulations we expect the internal color gradients to 
be correlated with the integrated rest-frame optical color 
(see Figure 9). We stress that the presence of a red core 
does not necessarily imply inside-out growth. In fact, in 
this scenario, and in the absence of subsequent merging 
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Fig. 9. — Ratio of Crest to V re st half-light radius as function of 
integrated color. The central line indicates the median relation, 
whereas the light- and dark-gray polygon indicate the 50% and 
100% percentiles of distribution over 100 lines-of-sight uniformly 
spread over a sphere. 



(see §6.3), the bulk of the youngest stars resides in the 
center, but the effect of the age gradient on the internal 
color profile is compensated by the presence of metallic- 
ity and extinction gradients. Finally, if negative M/L 
ratio gradients are indeed present, our results may imply 
that the observed quiescent galaxies may be an order of 
magnitude (<~ 2 3 ) more dense than previously inferred 
from HST/NICMOS observations (vD08). 

6. SURFACE BRIGHTNESS PROFILES 
6.1. The cuspiness of simulated merger remnants 

We established that gas-rich mergers are a viable mech- 
anism to assemble dense stellar systems of <~ 10 11 M Q 
and ~ 1 kpc size (§3), and that such a scenario would 
leave a color and M/L ratio gradient as imprint (§5). 
Now, we go beyond the zeroth-order structural measure- 
ment of size and compare how the surface brightness pro- 
files of simulated merger remnants compare to those of 
observed quiescent galaxies at z <~ 2. 

For this purpose, we use the mock F160W observations 
described in §2.3 that are matched in terms of PSF, pixel 
size, wavelength, redshift, and noise properties to the 
NIC2 observations of high- redshift spheroids by vD08. 
We analyze the real and mock observations in concert 
with the two-dimensional fitting code GALFIT (Peng et 
al. 2002) using identical settings. In Figure 10, we show 
postage stamps of the real NIC2 images by vD08. Next to 
each image, we plot the residual images after subtracting 
the best-fit Sersic profile when leaving the Sersic index 
n free, or fixing it to n = 2, n = 3, n = 4, and n = 10 
respectively. We rank the objects by best-fit Sersic index 
and find they span a large range from n — 4.5 to n = 
0.5, reproducing the vD08 results. In most cases, fixing 
the Sersic index to n = 10 leads to significantly larger 
residuals than leaving n free. 

Repeating the analysis for random viewing angles of a 
simulated merger remnant (Figure 11), we obtain strik- 
ingly different results. For the majority of sightlines, we 
find that exponential (n = 1) and even de Vaucouleurs 



(n = 4) profiles provide a poor fit to the mock data. 
Their n nx = 2 — 4 residual images (illustrated in the 
top 6 rows of Figure 11) are characterized by a central 
positive peak, surrounded by a negative ring, outside of 
which a positive wing is barely visible above the noise. 
Our results remain unchanged when running GALFIT 
with the sky level left as a free parameter. The origin 
of this characteristic pattern becomes clear when con- 
sidering a one-dimensional representation of the surface 
brightness, free of the PSF, noise, and pixelization effects 
of the mock observations (Figure 12a). Here, the black 
solid line indicates the median surface brightness pro- 
file of 100 sightlines uniformly distributed over a sphere. 
The dashed and dotted gray lines decompose the light 
into that of stars formed prior and during final coales- 
cence respectively. As discussed extensively by Hopkins 
et al. (2008b; 2009b), the young component produces a 
central cusp, and the combination of the young and old 
component yields a profile that is poorly characterized by 
a simple Sersic index. The colored curves in Figure 12(a) 
illustrate Sersic profiles with a total luminosity and half- 
light radius identical to that of the simulated merger rem- 
nant. The radially alternating positive-negative-positive 
residual pattern is immediately apparent, as is the ten- 
dency to fit high values of n. It is important to note that 
this tendency is driven both by the presence of a central 
cusp (the amplitude of which depends on the amount of 
gas present at final coalescence) and by the presence of 
the strong wings to the profile (the amplitude of which 
is determined by the amount of gas consumed during the 
progenitor phase). 

For a smaller number of sightlines (illustrated by the 
bottom 3 panels of Figure 11) a fiat residual image is 
obtained when fitting a Sersic index in the range 2 < 
n < 5. Obscuration of the central young component by 
dust is responsible for this sightline-dependent effect. 

We demonstrate our findings with more statistical ro- 
bustness (more lines of sight and timesteps, and sim- 
ulations with different orbital configurations) in Figure 
12(b). Here, we plot the profile shape (characterized by 
the best-fit Sersic index) as function of the half-light ra- 
dius recovered by GALFIT. Large black circles are the 
vD08 compact galaxies. They have a median n = 2.3, 
well below the n = 4 de Vaucouleurs profile that is char- 
acteristic for nearby spheroids, van Dokkum et al. (2010) 
attribute this evolution to minor merging building up the 
wings of the high-redshift compact nuggets over cosmic 
time. The Sersic profiles fitted to the simulated merger 
remnants (red dots) are significantly cuspier (higher n, 
adopting an upper bound of n = 10). For the subset 
(about one fifth) of sightlines where the central cusp is 
sufficiently obscured by dust, the inferred half-light ra- 
dius is several kpc, inconsistent with the observed com- 
pact systems. 

We draw a similar conclusion from a direct comparison 
of the surface brightness profiles of our simulated merger 
remnants to that of a massive compact quiescent galaxy 
at z = 1.91 in the Hubble Ultra Deep Field (Szomoru et 
al. 2010, see Figure 13). Szomoru et al. (2010) derive the 
deconvolved profile of this galaxy from very deep Wide 
Field Camera 3 (WFC3) imaging using a novel technique 
that corrects the best-fit Sersic profile with the residual of 
the fit to the observed image. They find that the surface 
brightness profile is well approximated by an n = 3.7 
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Fig. 10. — NIC2 F160W postage stamp images of the vD08 sample of quiescent galaxies, and residual images after subtracting the best 
fit when leaving n free, or fixing it to n — 2, 3, 4, and 10 respectively. The panels are sized 3" X 3", and for clarity the contrast of the 
residual images is double that of the observed image. Typically, the observed quiescent galaxies are best represented by Sersic profiles with 
n < 4. 
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Fig. 11. — Mock NIC2 F160W observations of simulated merger remnants. The panels are 3" on a side, corresponding to 24.6 
kpc at z = 2.3. For 80% of the sightlines, the surface brightness profile is best fit by a cuspy profile (fit runs into the upper 
bound of n = 10) . Significant residuals remain when fixing the Sersic index to lower values. For about one fifth of the sightlines 
(illustrated in the bottom 3 rows) lower n are obtained, but the recovered half-light radius for these sightlines is larger than 
that of observed quiescent galaxies by a factor of ~ 5. 
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Fig. 12. — (a) Median rest-frame V-band surface brightness profile of a simulated merger remnant (black solid line), decomposed into 
a young (formed during the final burst, dotted gray line) and old (formed prior to final coalescence, dashed gray line) stellar component. 
In color, we overplot Sersic profiles of identical total luminosity and half-light radius for Sersic indices n = 2, 3, 4, and 10. The simulated 
merger remnant has a different nature than a pure Sersic profile, but is better approximated by a high value of n than a low n. (b) Sersic 
index as function of half-light radius as derived from GALFIT two-dimensional fitting, for the vD08 sample (large black symbols), and 
binary merger (red dots) and isolated disk (blue dots) simulations of similar mass placed at a similar redshift (z = 2.3). Neither merger nor 
disk simulations simultaneously reproduce the size and profile shape of the observed quiescent galaxies. 
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Fig. 13. — Direct comparison of the surface brightness profiles of 
simulated merger remnants with the deconvolved profile of a mas- 
sive, compact galaxy at z = 1.91 in the Hubble Ultra Deep Field 
(Szomoru et al. 2010). The profile of the observed quiescent galaxy 
falls off more rapidly at large radii than the simulated profiles. 



Sersic profile. Typically, our simulated merger remnants 
have a higher surface brightness in the r > 2 kpc wings 
relative to the surface brightness in the 0.2 < r < 2 kpc 
range. 

The ability to correctly recover the true half-light ra- 
dius known from the simulation depends on the upper 
bound set on the Sersic index. For n max = 10, we find 
a systematic overestimate of the size by ~ 40%, whereas 



setting n max = 4 improves the size measurement to about 
~ 10% in the median, with a scatter in of ~ 0.3. So, 
whereas the mock images for the majority of sightlines 
are best fit by high values of n, these best fits do not 
properly recover the true half-light radius of the system. 
This emphasizes the fact that an individual Sersic profile 
poorly describes its structure. 

We assessed the impact of observational depth by ar- 
tificially decreasing the signal-to-noise ratio of our mock 
observations (in practice, we reduce the source bright- 
ness, and add them to empty regions of the same NIC2 
images). Running GALFIT with n max = 10, this yields 
an increase in both the systematic size overestimate and 

the scatter in over a range of viewing angles. Adopt- 
ee 

ing n max = 4, we find that, in the median, the recovered 
size is consistent with the true size over the full range 
of input source magnitudes 21 < magFi60W < 24. How- 
ever, the scatter in does increase by a factor 2-3 as 
we compare sources with magFi60W = 24 to sources of 
identical profile shape with magFi60W = 21. We caution 
that different biases may be at play in the size measure- 
ment of observed compact quiescent galaxies such as the 
vD08 sample. Their intrinsic profile shape (i.e., noise- 
free and at infinite resolution) is unknown, but we did 
demonstrate it differs from that of our sample of sim- 
ulated merger remnants as they translate into different 
(mock) observed properties. As an independent test, we 
also inserted pure Sersic profiles of ~ 1 kpc size, with 
1 < n < 4 and a range of luminosities into empty regions 
of the NICMOS images. Running GALFIT on them, 
we find qualitatively similar results regarding potential 
biases in size measurements. The scatter in the recov- 
ered size increases with decreasing signal-to-noise ratio. 
Whereas profiles of similar magnitude as the galaxies 
from vD08 (21 < ma,g Fleow < 22) were recovered cor- 
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Fig. 14. — Idem as Figure 12, but for a series of simulations where the progenitor properties were varied. Vertical lines at the bottom of 
panel (a) indicate the half-light radii in the rest-frame V-band for which the surface brightness profiles are plotted. Dotted vertical lines on 
the top of panel (a) indicate the radii containing half of the rest-frame [/-band light. When running GALFIT on mock NIC2 observations 
of the simulations placed at z ~ 2.3, the inferred Sersic index and size (b) span a different part of parameter space than covered by the 
vD08 sample of observed quiescent galaxies (large black circles). 



rectly, systematic overestimates of the size by a factor 
1.06 (1.2) in the median and 1.1 (2.2) in the mean are 
found when decreasing the brightness of non-exponential 
(2 < n < 4) sources by 1 (2) magnitudes. In other words, 
perhaps counter-intuitively, size measurements from 2D 
surface brightness fitting on low signal-to-noise data may 
be biased towards size estimates that are too large, rather 
than too small. 

Computing the mock observations of our simulated 
merger remnants using the LOS and SUNRISE radia- 
tive transfer codes yields qualitatively similar results. 
Namely, that the simulated surface brightness profiles 
tend to be cuspier, and in general occupy a different lo- 
cus in (n, r e )-space than the observed compact galaxies 
at z ~ 2. Moreover, we confirm that this difference is 
notable even given the limitations in depth and resolu- 
tion of present-day observations. In detail, we note that, 
for a given simulation, the LOS code produces cuspier 
profiles than SUNRISE, yielding significant residuals af- 
ter subtracting a n — 10 fit (e.g., rows 1-3 of Figure 11) 
whereas these are mostly flat when running GALFIT on 
SUNRISE images (e.g., rows 4-6 of Figure 11). 

Finally, we carried out a similar procedure on mock im- 
ages of simulations where we let disk galaxies evolve in 
isolation. The initial disks are designed to have an expo- 
nential mass profile. We checked that GALFIT correctly 
recovers their n = 1 profile shape from mock images of 
the first simulation snapshot (values of 0.6 < n < 1 are 
recovered for some sightlines due to dust obscuration in 
the central region) . Eventually, since gas is consumed by 
star formation and we did not allow for gas replenish- 
ment, the simulated disk galaxies also reach low specific 
star formation rates. By the time this quiescent phase 
is reached, the disks have evolved in such a way that we 
find a range of Sersic indices similar to that of the vD08 
sample (see Figure 12b). However, in all cases, their re- 
covered size is a factor of several too large to reproduce 
the observed quiescent galaxies. 



6.2. Dependence on progenitor properties 

We now consider a few variations on the formation 
scenario for compact quiescent galaxies, with an empha- 
sis on how they alter the consistency with observational 
constraints on size and profile shape. First, the appar- 
ent discrepancy in profile shape of the merger remnants 
may stem from inadequate assumptions on the structure 
of the progenitors from which they formed. Figure 14 
presents a quantitative, but by no means complete, ex- 
ploration of the vast parameter space in which progenitor 
properties can vary. First, we ran a simulation where the 
spin parameter of the dark matter halos was increased 
from A = 0.033 to A = 0.1 (green), with otherwise de- 
fault settings. The higher spin parameter has a stabi- 
lizing effect on the gas disk, lowering the star formation 
rate during the progenitor phase, and increasing the gas 
fraction at the time of coalescence (/ gas = 0.64 com- 
pared to /gas = 0.42 for our fiducial model). The result 
is an even more concentrated remnant. Less stars formed 
prior to the merger, but they were redistributed in such a 
manner that it is mainly at intermediate radii (around 1 
kpc) that the surface brightness profile is devoid of light 
with respect to our default simulation. This is where the 
residual to a Sersic fit with 2 < n < 4 was already nega- 
tive (see Figure 12a), causing the best fit to run into the 
upper bound set on n again. Second, we investigate a 
simulation with the same total baryonic mass, but with 
progenitor galaxies that are built up of a stellar bulge 
in addition to a gas-rich disk, each contributing half of 
the baryonic mass (red). Secular formation processes to 
grow such a bulge by coalescence of clumps in gas-rich 
disks (i.e., without invoking additional merging) have re- 
cently been proposed by Genzel et al. (2008), Elmegreen, 
Bournaud & Elmegreen (2008, and Dekel et al. (2009b). 
We find that merging the bulge+disk systems leads to 
a compact (r e v>est — 1 kpc) remnant with significantly 
smoother central profile, but the strong wings remain in 
place. Analyzing its mock observations with GALFIT 
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yields a locus in (n, r e )-space that does not overlap with 
the vD08 sample (see Figure 14b). 

The same conclusion can be drawn from a third and 
fourth variation where we increase the disk dispersion 
twofold (orange) and adjust the merger mass ratio from 
equal mass to 1:3 (blue) respectively. The latter varia- 
tions are inspired by the small - = 2 — 4 ratios of star- 
forming disk galaxies at z ~ 2 observed by the SINS sur- 
vey (Forster Schreiber et al. 2009), and by the paucity of 
equal-mass mergers compared to mergers of mass ratio 
1:3 (Naab et al. 2007; Guo & White 2008; Hopkins et 
al. 2010b). We do not consider smaller mass ratios as 
they would not quench the star formation and produce 
a quiescent spheroid. 

Finally, we considered the effect of adopting a lower 
pressurization of the ISM (qeos — 0.35 instead of 
Qeos — 1 m the nomenclature of Springel, Di Matteo & 
Hernquist 2005). This leads to enhanced star formation 
in the outer regions during the early stages of the merg- 
ing process, resulting in more pronounced wings in the 
remnant. This is in qualitative agreement with Bournaud 
et al. (2010) who find that the remnants of mergers of 
clumpy disks are cuspier (i.e., have higher Sersic indices 
and more material at large radii) than those produced by 
merging disks in which the effective ISM is not allowed 
to fragment so completely. 

Overall, the considered variations in progenitor prop- 
erties have only a modest impact on the structure of the 
merger remnants: they occupy a similar region in the 
size-mass diagram, and have M/L ratio gradients that 
translate into similar color gradients (i.e., red cores; the 
dashed and solid lines in Figure 14a indicate the rest- 
frame U and V half-light radii respectively). Moreover, 
whereas the amplitude of the central cusp shows a de- 
pendence on the specific variations in progenitor struc- 
ture explored here, the extended outskirts of the light 
distribution are present in each of the above model vari- 
ations. As discussed in Section 7, a wider exploration 
of parameter space, involving physics leading to relative 
changes in the star formation efficiency between low- and 
high-density environments, may be appropriate to struc- 
turally alter the outer regions of the merger remnants. 

6.3. The impact of subsequent merging 

In this section, we address whether subsequent merging 
can alter the structure of the simulated merger remnants 
in such a way that their profile shape shows a closer re- 
semblance to that of observed quiescent galaxies. To this 
extent, we run a set of re-merger simulations in which 
the remnant of the initial gas-rich merger collides with a 
star-forming galaxy of lower mass. We consider mass ra- 
tios 1:6 to 1:2, and initial gas fractions of 80%, 40% and 
5% for the star-forming galaxies with which the merger 
remnant interacts. The resulting surface mass densities 
are presented in Figure 15. A more extended analysis, 
involving merger rates as function of mass, mass ratio, 
redshift and (redshift-dependent) gas fraction is required 
to properly model spheroid evolution (Khochfar & Silk 
2006b; Naab et al. 2009; Hopkins et al. 2010a), but for 
our present purpose it is sufficient to highlight two char- 
acteristic features notable in our re-merger simulations. 

First, the re-merger is consistently larger than the orig- 
inal merger remnant. The relation between the half-mass 
radius before and after depends on the nature (mass ra- 
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Fig. 15. — Surface mass density profile of a gas-rich merger 
remnant {black), and the products after re- merging it with star- 
forming galaxies of varying mass and initial gas fraction {colored 
curves). Half-mass radii are indicated by the vertical lines. Sub- 
sequent merging of a compact gas-rich merger remnant leads to a 
further build-up of its outer regions while the central dense cusp is 
little affected. This leads to an increase of the size, an effect that 
is most pronounced when the amount of dissipation is minimal. 



tio, gas fraction, ...) of the re-merger. As expected, the 
size growth is larger for the 1:2 than for the 1:6 mass 
ratios. At a given mass ratio, the growth is more pro- 
nounced when the amount of dissipation is minimized 
(i.e., dry merging is more efficient at expanding galax- 
ies). 

Second, the re-merger remnant is not homologous to 
the original gas-rich merger remnant (see also Hopkins et 
al. 2009c; Hopkins & Hernquist 2010). Typically, we see 
the build-up of an outer wing while the dense central cusp 
is only little affected. This effect is more pronounced as 
we consider re-mergers with a lower dissipational frac- 
tion. The net effect is that, if anything, higher values 
of n are preferred in fitting the products of subsequent 
merging. In addition, their size at a given mass grows 
too big to be consistent with the observed spheroids at 
z ~ 2. While subsequent (minor) mergers might well 
drive the structural evolution of spheroids from z ~ 2 to 
the present day (Naab et al. 2009; van Dokkum et al. 
2010; Hopkins et al. 2010a; Carrasco et al. 2010), they 
are not a viable mechanism to produce the z ~ 2 com- 
pact galaxies in the first place. Even if such an external 
process would yield sizes and profile shapes that matched 
the observations, it would not have offered a satisfactory 
solution, since it is unlikely that every quiescent galaxy 
observed at z ~ 2 experienced a second interaction after 
the initial quenching event. 

6.4. Stellar mass loss 

In light of the last comment, it is interesting to con- 
sider internal, rather than external, processes that may 
alter the light distribution. One such process is stellar 
mass loss. The simulations presented in this paper were 
run with GADGET-2's default settings that account for a 
small amount of mass loss. Ten percent of the gas mass 
converted into stars is instantaneously returned to the 
interstellar medium (ISM, Springel & Hernquist 2003). 
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With it, energy is injected in the surrounding ISM to 
model the effect of short-lived stars that explode as su- 
pernovae. For currently favored stellar initial mass func- 
tions (IMFs) such as a Kroupa (2001) or Chabrier (2003) 
IMF, however, eventually as much as half of the gas mass 
initially converted into stars is returned to the ISM. Ini- 
tially, the mass loss proceeds fast and in an explosive 
fashion, reaching 10% in 7 Myr and 25% in 60 Myr. At 
later times, AGB stars drive more gradual winds, in- 
creasing the released fraction to <~ 40% after a gigayear 
(Bruzual & Chariot 2003). In principle, three things can 
happen to the mass lost due to winds: either it escapes 
the galaxy's potential entirely, or it settles within the 
galaxy and stays there as gas, or it is consumed by star 
formation again. In addition, the redistribution of bary- 
onic mass due to mass loss can potentially change (lower) 
the central potential, causing the distribution of stars to 
adjust (expand) accordingly. Fan et al. (2008) propose 
such a scenario of adiabatic expansion due to mass loss 
driven by AGN and/or stellar feedback as a mechanism 
for spheroid growth between z ~ 2 and z ~ 0. Others 
(Bezanson et al. 2009; van Dokkum et al. 2010; Hopkins 
et al. 2010a) have objected to this process as a driver of 
structural evolution since z ~ 2, amongst other reasons 
because most of the mass loss is expected to take place 
within 500 Myr after the stars are formed, i.e., prior to 
the phase when the galaxies are observed as being qui- 
escent. The same timescale argument works in favor of 
our present discussion. If mass loss modifies the surface 
brightness profile substantially, it will have happened by 
the time we observe the galaxies as being quiescent. 

The precise effect of such extensive mass loss is how- 
ever not trivial to predict, as the energetics, the amount 
of mass loss, and the location of the stars when they lose 
mass all depend on time. Moreover, the timescales of 
stellar evolution (i.e., mass loss), star formation history 
of the system, and merger dynamics all overlap. It will be 
interesting to investigate in future numerical work how a 
time-dependent treatment of extensive stellar mass loss 
and its feedback affects the outcome of simulations as 
the ones studied here. Since this is beyond the scope of 
this paper 3 , we now limit ourselves to simple toy mod- 
els addressing the impact of, e.g., feedback on how the 
simulated quiescent galaxies would be observed. 

6.5. Feedback, radiative transfer, and preventing star 
formation at early times 

In these toy models, we apply a simple transforma- 
tion to a simulation snapshot of a merger remnant: an 
expansion by a fixed factor of the central component 
where the young stars dominate (Figure 16a), and addi- 
tional extinction that decreases (Figure 16c) or increases 
(Figure 16e) with radius. The first toy model may be 
thought of as an ad hoc implementation of feedback, 
whereas the latter two explore dramatic differences in 
the effect of radiative transfer. The corresponding di- 
agrams of recovered Sersic index versus recovered size 
when feeding mock NIC2 observations of the toy mod- 
els to GALFIT are presented in the right-hand panels. 

3 We did test that, ignoring any time dependence of mass loss, 
increasing the fraction of material instantaneously returned to the 
ISM from 10% to 25% barely changes the surface brightness profile 
of the remnant. 



The color-coding traces the expansion or extinction fac- 
tor, where the legend quotes the extinction at r 1 / 4 = 0.5 
kpc 1 / 4 and r 1 / 4 = 1.5 kpc 1 / 4 for the radially decreasing 
and increasing extinction respectively. 

Figure 16(b) shows that, if a physical mechanism that 
is currently not or improperly modeled is capable of puff- 
ing up the central stellar cusp by over an order of magni- 
tude, the best-fit Sersic indices enter the regime of those 
of the observed compact galaxies. However, the same 
transformation would lead to a violation of the observa- 
tional constraints on size by a factor of several. 

Likewise, larger sizes are inferred from toy model im- 
ages where the central region is heavily obscured (Figure 
16d). Moreover, as we already noted in the case of merg- 
ing bulge+disk progenitors (§6.2), a reduction of the cen- 
tral cusp without any change to the outer wings does not 
necessarily lower the Sersic index inferred from the mock 
observations of limited depth and resolution. 

A better correspondence with the locus of the vD08 
sample in (n, r e )-space is obtained when we lower the 
amplitude of the profile wings (Figure 16c). The plotted 
transformations reduce the true half-light radius (vertical 
lines in Figure 16e) by a factor 2 to 7, but the sizes 
inferred by GALFIT are of order 1 kpc. Although these 
toy models have observed properties similar to the vD08 
galaxies, the required transformation factors are large 
(an order of magnitude or more) and it is hard to think of 
a radiative transfer effect related to the dust distribution 
or intrinsic stellar population properties in the wings that 
can account for such a deviation from our fiducial model. 

There is another way of interpreting the last toy model. 
The outskirts are built up of old stars that formed in the 
progenitor disks and were redistributed by violent re- 
laxation during the merger. Consequently, limiting the 
amount of star formation at early times may lower the 
amplitude of the wings. In the binary merger simula- 
tions considered here, this is hard to accomplish since 
all gas is already present at the start of the simulation 
and the compact gas-rich disks are sensitive to gravita- 
tional instabilities giving rise to star formation. This 
is the case even for the fully-pressurized multiphase ISM 
model adopted throughout this paper (<7eos = !)• A way 
to circumvent this problem would be to delay the sup- 
ply of gas or spread it more continuously over time. In- 
deed, recent cosmological simulations suggest that galax- 
ies during the first 3 billion years after the Big Bang were 
not closed systems, and supply of gas through cold fila- 
ments and streams was a generic feature of their evolu- 
tion (Keres et al. 2005; Dekel & Birnboim 2006; Ocvirk 
et al. 2008; Dekel et al. 2009a). As such, reproducing 
not only the stellar population properties and size-mass 
relation, but also the profile shape of high-redshift quies- 
cent galaxies may require simulations in a cosmological 
setting. This is numerically very challenging as a high 
spatial and timestep resolution (see §2.1) remain essential 
to model the detailed structure of the merger remnants, 
as well as the accretion processes onto the supermassive 
black hole(s) that contribute to the quenching of their 
star formation. Consequently, a resimulation technique 
(see, e.g., Tormcn, Bouchet & White 1996) may be ap- 
propriate, as applied for example by Naab et al. (2007, 
2009) in the context of spheroid formation. 
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Fig. 16. — Idem as Figure 12, but for three toy models where we apply a simple transformation to a snapshot from a fiducial 
simulated merger remnant: (a-b) an expansion of the central region, (c-d) radially decreasing extinction, (e-f) radially increasing 
extinction. The high Sersic indices preferred in fitting the two-dimensional surface brightness profiles are driven by both the 
central cusp and the amount of light emerging from the extended wings of the profile. Only adjusting the central profile shape 
(a-d) is insufficient to match the vD08 galaxies in (n, r e )-space. A mechanism to reduce the amount of stars at large radii 
(typically formed in the progenitors long before final coalescence), or the amount of light emerging from them seems required. 
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Fig. 17. — Star formation history of a gas-rich merger simulation producing a compact remnant. The spectral energy distribution (SED) 
of simulated remnants (red in inset panel) provides a good match to the mean SED of the observed compact quiescent galaxies from vD08. 
White, blue and red circles in the log-scaled postage stamp show the circularized region containing half the total mass, U- and V-band 
light respectively. A mass-to-light ratio gradient is present in the simulated remnant, which typically results in larger half-light radii at 
shorter wavelengths. 



7. DISCUSSION AND SUMMARY 

The existence of ultra-compact, massive (~ 10 11 Mq) 
quiescent galaxies already in place merely 3 billion years 
after the Big Bang has been one of the great surprises of 
galaxy evolution studies in the last few years. Their dis- 
covery has triggered questions regarding their origin and 
fate, as well as the reliability with which their properties 
such as mass and size are characterized. 

In this paper, we address the formation and nature of 
such systems in the context of (gas-rich) merger scenar- 
ios. As such, this study is part of a larger effort that 
couples hydrodynamic simulations with radiative trans- 
fer in order to investigate how different types of observed 
high-redshift galaxies may tie together as stages of one 
evolutionary sequence. Figure 17 illustrates a typical star 
formation history of a gas-rich merger simulation. Gas- 
rich star- forming galaxies (blue) are abundantly present 
at high redshift (Erb et al. 2006; Tacconi et al. 2010). 
When in the process of merging (yellow), the morphol- 
ogy of such systems may (but will not always) appear 
disturbed. The timescales of a morphological merger sig- 
nature depend on mass ratio (Lotz et al. 2010a) and gas 
fraction (Lotz et al. 2010b) of the merger. Narayanan et 
al. (2010) describe how during a relatively short (< 50 
Myr) period at final coalescence ( orange) a peak in emis- 
sion at sub-millimeter wavelengths is reached. Continu- 
ing over a somewhat longer (~ 100 Myr) timespan, the 



system may be identified as a so-called dust-obscured 
galaxy (DOG), characterized by its large 24 /im to opti- 
cal flux ratio, although not every DOG, and particularly 
those with low 24 /im luminosity, is a gas-rich merger 
(Narayanan et al. 2009). Shortly after the peak in star 
formation rate (at t S i nce = 64 Myr for the particular 
simulation shown), the accretion rate onto the central 
supermassive black hole peaks. The role of merging in 
galaxy - black hole coevolution has been discussed exten- 
sively by Hopkins et al. (2006, 2008a). Finally, Wuyts 
et al. (2009b) investigated the role major mergers may 
have played in producing the above mentioned compact 
systems, in terms of their number and mass densities as 
well as their integrated stellar population properties. It- 
erating on the latter aspect, the inset panel in Figure 
17 contrasts the distribution of rest-frame UV to near- 
infrared SEDs of merger remnants computed from our 
simulations (median, 50th and 100th percentiles are dis- 
played in red) to the mean rest-frame SED of quiescent 
galaxies from the vD08 sample (Muzzin et al. 2009b, 
black data points, with error bars representing the error 
on the mean). The model and observations show an ex- 
cellent agreement over the full wavelength range probed, 
boosting confidence that the modeled stellar populations 
reflect reality at least in an integrated sense. 

In this paper, we focussed on the structure and re- 
solved stellar populations of merger remnants. We used 
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the sample of massive quiescent galaxies from vD08 as 
reference sample. Their near-infrared spectra are char- 
acterized by a Balmer/4000A break (Kriek et al. 2006). 
Deep NIC2 observations (vD08) have revealed their com- 
pact nature at the highest resolution currently available. 
Analyzing a suite of binary merger simulations of vary- 
ing mass, gas fraction, progenitor scaling, and orbital 
configuration, we confirm the idea originally proposed 
by Khochfar & Silk (2006a) that major mergers can ex- 
plain their location in the size-mass diagram provided 
they are gas-rich. Merging galaxies scaled to represent 
high-redshift star-forming disks, we find that systems of 
~ 10 11 Mq with half-mass radius ~ 1 kpc can be formed 
when the gas fraction by the time of final coalescence is 
about ~ 40%. Observational evidence for gas fractions 
of this magnitude has been accumulating in recent years 
(e.g., Tacconi et al. 2010). The corresponding velocity 
dispersions of these simulated massive compact galaxies 
are of order 300 - 400 km s _1 . They show considerable 
rotation (v ma j/a of up to unity) compared to the major- 
ity of lower redshift early- type galaxies, a result that has 
yet to be confirmed observationally. 

Running radiative transfer on the output of our SPH 
simulations, we find that the merger remnants have a 
radially dependent mass-to-light ratio. Typically, when 
observing the remnant 500 Myr to 1 Gyr after the peak 
in star formation rate, the half-mass radius is a factor 
~ 2 smaller than the rest-frame t^-band half-light radius. 
This implies that the high effective densities inferred 
from NICMOS and WFC3 observations of high-redshift 
quiescent galaxies may in fact only be lower limits. In the 
rest-frame [/-band, the typical light-to-mass size ratio in- 
creases to a factor 3-4. The ratio r ei iight/r eima ss shows a 
significant sightline-dependence, with a tail to high val- 
ues (> 5). Our conclusions are robust to the choice of 
radiative transfer code, input stellar population synthe- 
sis models, dust attenuation law, and whether or not the 
progenitor disks had stellar population gradients. The 
M/L ratio gradient is expressed as a color gradient that 
correlates with the integrated color of the system (red- 
der galaxies hosting a more pronounced red core). We 
caution that the presence of a red core cannot unambigu- 
ously be interpreted as evidence for inside-out growth. In 
fact, in the simulations studied in this paper, the central 
stellar population formed last, rather than first. The ef- 
fect of the negative age gradient is compensated by the 
fact that the stars inside the half-mass radius are more 
metal-rich, and suffer more extinction. 

Finally, we perform two-dimensional parametric fitting 
with the GALFIT code (Peng et al. 2002) on real and 
mock NIC2 observations with the same limitations of res- 
olution and depth. We find that our simulated merger 
remnants occupy a different region in (n, r e )-space than 
the vD08 quiescent galaxies. So do simulations of disk 
galaxies that are left to evolve in isolation. The high Ser- 
sic indices (n » 4) inferred from the mock observations 
are driven by both the central cusp (consisting of young 
stars) and the extended wings (consisting of old stars). 
We explored a number of variations in progenitor proper- 
ties that influence the amplitude of the central cusp, but 
have little effect on the outskirts. Subsequent (minor) 
merging tends to build up the profile wings even further 
while leaving the central cusp in the mass distribution 
relatively unaffected. 



By lack of cosmological context, the binary merger sim- 
ulations start with large amounts of gas in the progenitor 
disks (up to 80% of the baryonic mass content) in order to 
have a sufficiently large gas fraction by final coalescence 
to model the dissipational merger event. Given this large 
gas reservoir at the start of the simulation, it is hard to 
prevent it from forming stars and ending up forming the 
extended wings of the remnant profiles (wings that are 
not seen to that degree in the observations) . As a conse- 
quence, while gas-rich mergers as simulated in this paper 
reproduce the basic structural diagnostic (the size-mass 
diagram), second order structural properties such as pro- 
file shape may differ because too many stars are formed 
at early times. This could be prevented if the gas was 
not all present initially but instead accreted more grad- 
ually over time, or if star formation and/or stellar mass 
loss and its feedback operated in such a way that the 
efficiency of converting gas into stars was lower at low 
densities than assumed here (i.e., in the progenitor disks 
and particularly their outer parts). One could speculate 
that a low gas-phase metallicity, turbulence from accre- 
tion (Bournaud et al. 2010), or the elevated intensity of 
the ionizing background radiation at z ~ 2 inhibit the 
formation of molecular gas and thus stars during this 
early phase, especially at low densities where the gas may 
still be pristine. 

We demonstrate that, in addition to spatial resolution, 
timcstcp resolution is important in modeling the detailed 
structure of such extreme systems properly. Therefore, 
running spheroid evolution simulations in a full cosmo- 
logical context is computationally very expensive. Pio- 
neering efforts to simulate spheroid formation in a cos- 
mological context (without AGN feedback) have recently 
been carried out by Naab et al. (2007; 2009). These 
authors apply a resimulation technique on three target 
halos extracted from a low-resolution dark matter simu- 
lation to follow the formation and evolution of spheroids 
within those halos from high redshift to the present day. 
In agreement with our findings, Naab et al. (2007; 2009) 
argue that at high redshift dissipational processes on 
short timescales are a plausible mechanism to produce 
the compact galaxies observed at z <~ 2. At later times, 
they find that minor and gas-poor merging forms an ef- 
ficient means to build up an envelope around this core 
(see also Section 6.3 of this paper, and Hopkins et al. 
2009a). 4 

In the scenario investigated in this paper, the initial 
dissipational event occurs in the form of a major merger. 
Such a scenario connects three types of galaxies observed 
at high redshift in one evolutionary sequence: gas-rich 
disks, dusty starbursts with star formation rates peaking 
to <~ 1000 M /yr, and quiescent remnants. The dissi- 
pational event modeled by Naab et al. (2007; 2009) is 
driven by efficient radiative cooling and collapsing of gas, 
not necessarily involving major merging. It remains to be 
seen whether such a model can account for the peak star 
formation rates observed in, e.g., sub- millimeter galaxies, 
and the post-starburst nature of the stellar populations 
of compact galaxies. Observationally, measurements of 

4 In detail, as noted already by Naab et al. (2009), their sim- 
ulated spheroid with size r e = 2.4 ± 0.4 kpc and mass M = 
1.5 X 10 Mq lies in the low tail of the size distribution of ob- 
served nearby galaxies with similar mass. 
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clustering, from large scales to pair counts, based on wide 
and deep cosmological surveys could potentially provide 
support for a merger-driven evolutionary scenario. 
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